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ABSTRACT 

ON 

It is well known that estimating the mean pairwise velocity of galaxies, v 12 , from 
the redshift space galaxy correlation function is difficult because this method is highly 
sensitive to the assumed model of the pairwise velocity dispersion. Here we propose 
Q | an alternative method to estimate v 12 directly from peculiar velocity samples, which 

contain redshift-independent distances as well as galaxy redshifts. In contrast to other 
dynamical measures which determine (3 = Q°' 6 a s , this method can provide an estimate 
of f2°' 6 cr 8 2 for a range of <r g where Q is the cosmological density parameter, while cr 8 is the 
standard normalization for the power spectrum of density fluctuations. We demonstrate 
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l/") . how to measure this quantity from realistic catalogues. 

(N . 

Subject headings: Cosmology: theory - observation - peculiar velocities: large scale 
. flows 

1. Introduction 

h 

In this Letter we investigate the possibility of using the "mean tendency of well-separated 
galaxies to approach each other" (Peebles 1980, hereafter LSS) to measure the cosmological density 
parameter, 0. The statistic we consider is the mean relative pairwise velocity of galaxies, v 12 . It 
was introduced in the context of the BBGKY theory (Davis and Peebles 1977), describing the 
dynamical evolution of a collection of particles interacting through gravity. In this discrete picture, 
v 12 is defined as the mean value of the peculiar velocity difference of a particle pair at separation 
r (LSS, Eq. 71.4). In the fluid limit, its analogue is the pair-density weighted relative velocity 
QFisher et al. 1994 |Juszkiewicz et al. 1998a|) , 
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where v A and 5 A = p A j (p) — 1 are the peculiar velocity and fractional density contrast of matter at 
a point r A , r = \r 1 — r 2 \, and £(r) = (5i<5 2 ) is the two-point correlation function. The pair-weighted 
average, (• • -) p , differs from simple spatial averaging, (• • •), by the weighting factor p 1 p 2 {piP 2 } , 
proportional to the number-density of particle pairs. In gravitational instability theory, the mag- 
nitude of v 12 (r) is related to the two point correlation function, £(r), through the pair conservation 
equation (LSS, Eq. 71.6). For models with Gaussian initial conditions, the solution of the pair 
conservation equation is well approximated by flJuszkiewicz et al. 1998b ) 



v 12 (r) = -§#r/£(r)[l + a£(r)] , (2) 

£(r) = (3/r 3 ) f x 2 dx = f (r) [ 1 + £(r) ] (3) 
J 

Here a is a parameter, which depends on the logarithmic slope of £(r), while / = dlnD/dlna, 
with -D(a) being the standard linear growing mode solution and a - the cosmological expansion 
factor (see e.g., LSS, §11). Finally, H = 100 h km s -1 Mpc is the present value of the Hubble 
constant. For a wide class of cosmological models, including those with non-zero cosmological 
constant, / ps O ' 6 ([Peebles 1993| , hereafter PPC, §13). If £ oc , and 7 is scale-independent, 
the predicted streaming velocity can be expressed in terms of f2 and the standard normalization 
parameter, cr 8 - the rms matter density contrast in a ball of radius 8/i _1 Mpc. For a pure power-law 
f (r), we have (PPC, Eq. 7.72) 

£( r ) = 3£(r)/(3- 7 ) = cr 8 2 (16 /i" 1 Mpc/r) 7 (4 - 7 )(6 - 7 )/24 . (4) 

For < 7 < 3, the parameter a is given by flJuszkiewicz et al. 1998b| ) 

a w 1.2 - 0.657 • (5) 

The approximate solution of the pair conservation equation, given by equations © - © accurately 
reproduces results of high resolution N-body simulations in the entire dynamical range ( |Juszkicwicz 



et al. 1998b| ). This approximate solution was designed to reproduce the second-order Eulerian 



perturbation theory solution in the weakly nonlinear regime ( |£| < 1 ) and the stable clustering 
solution in the strongly nonlinear regime (£ S> 1, v 12 (r) = —Hr ; see [Scoccimarro fc Frieman 1996| 



and Lokas et al. 1996 for the second-order correction for £ and LSS, §71 for the stable clustering) 



To get a better idea of how the Equation (g) can be used to estimate Q, let us consider a 
numerical example: v 12 at a separation r = \0fi~ 1 Mpc . One can use the APM catalogue of galaxies 
(Efstathiou 1996) for an estimate of 7. The slope at the separation considered is 7 = 1.75 ± 0.1 
(the errors we quote are conservative). Substituting Eqs. || and || into Eq. ^, and setting 7 = 1.75, 
we get 

v^Wh- 1 Mpc) = -605a 8 2 fi - 6 (l + 0.43fj 8 2 )/(l + 0.38fj 8 2 ) 2 km/s . (6) 

The above relation shows that at r = 10/i -1 Mpc, v 12 is almost entirely determined by the values 
of two parameters: cr 8 and £1. It is only weakly dependent on 7. This dependence is caused by the 
ct^ 7 term in Eq. ^. However, for all realistic values of 7, a is a small number. The uncertainties in 
the observed 7 lead to an error in Eq. I of less than 10% for a 8 < 1. 
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The approximate solution, given by Equation || accurately reproduces v 12 (r) curves for dark 
matter particles, measured from high resolution N-body simulations ( Juszkiewicz et al. 1998b| ). 
Moreover, Eq. ^ agrees well with measurements of mean streaming velocities of "galaxies", v 12g (r) , 
obtained in recent simulations, which attempt to take into account non-gravitational processes like 
star formation and radiative cooling ( [Kaufmann et al. 199£ ). These simulations exhibit clustering 
bias, but no velocity bias (v 12g = v 12 ), suggesting that galaxies constitute reliable test particles, 
driven by the gravitational field of the true mass distribution even if the galaxies themselves are 
biased tracers of mass. 



2. The estimator 

Since we observe only the line-of-sight component of the peculiar velocity, s A = r A -v A /r = 
r A ■ v A , rather than the full three-dimensional velocity v A , it is not possible to compute v 12 directly. 
Instead, we propose to use the mean difference between radial velocities of a pair of galaxies, 
(s 1 — s 2 ) p = v 12 r ■ (fi + r 3 )/2, where r = r x — f 2 . To estimate v 12 , we use the simplest least 
squares techniques, which minimizes the quantity x 2 ( r ) = J2 AB [( s a — s B ) — p AB v 12 (r)/2] 2 , 
where p AB = f ■ (f A + r B ) and the sum is over all pairs at fixed separation r = \r A — r B \. The 
condition d\ 2 / dv 12 = implies 



2 E (sa ~ s b )Pab 



v 12 (r) = 7^ 4a . (7) 



This estimator is appropriate to be applied to a point process which will sample an underlying 
continuous distribution. The sampling is quantified in terms of the selection function, <j>(f). The 
continuum limit of Eq. [?] is then 

~ , , _ 2 J dm 1 dm 2 ^> 12 (s^ - s 2 )p 12 

Vl2[r) ~ fd mi dm 2 <S> 12Pl2 i ' W 

with dm A = p A d 3 f A , and a two-point selection function given by <£ 12 = 8d(\t\ — r 2 \— r) 0(r x ) 4>(r 2 ) , 
where 5n is the Dirac delta function. For ease of notation we shall denote the denominator in Eq. || 
by W(r). If we take the ensemble average of Eq. M we find that (v 12 (r)) = v 12 (r). Note that, unlike 



the estimators for the velocity correlation tensor proposed in Gorski et al. 1989, the ensemble 
average of the estimator is v 12 (r) independent of the selection function. For an isotropic selection 
function this estimator is insensitive to systematic effects such as a bulk flow, large scale shear and 
small scale random velocities (as one might expect from virialized objects). 

To assess how useful this statistic is in practice we calculate the covariance matrix of v 12 (r); 
this involves calculating the ensemble average, ((s 1 — s 2 )(s 3 — s 4 )) p = ((s 1 — s 2 )(s 3 — s 4 ))+ higher 
order terms. Unlike most statistics, the number weighting leads to a variance which is of the 
same order in perturbation theory, 0(5 2 ), as the actual quantity one is trying to estimate. The 
covariance between estimates of the pairwise velocity at two different separations, v 12 = v 12 (r), and 
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v' 12 = v 12 (r'), can be expressed as 

(v 12 v[ 2 ) - (v 12 )(v[ 2 ) = J d 3 r 1 d 3 r 2 ^(r 1 )0(r 2 )^(r,r 1 )^(r / ,f 2 )r u ^ij(r)r 2j , (9) 

where r Aj is the j th cartesian component of r A (A = 1,2; j = 1,2,3), while VPy(r) = (vi(0)vj(r)} 
is the velocity correlation tensor, and fi(r, fj) = 2W~ 1 (r) f d 5 r 2 (f)(r 2 )5r)(\r 2 — f x \ — r)p 12 . In the 
linear regime, Vl/^ can be expressed in terms of P(k) — the power spectrum of density fluctuations 
(IGorski 1988| , |Groth et al. J98| ), % (f) = (^/ 2 /8vr 3 ) / d 3 kP(k) (hkj/k 4 ) exp(ik • f). The 



form of the selection function will dictate the dependence of the variance on scale. As one would 
expect, the smaller the depth of the selection function, the larger the variance. This is illustrated in 
Figure |l](a,b) where we plot the mean (dotted line) and variance of v 12 (r) for two COBE normalized 
CDM models and for a choice of two selection functions. In this Letter we shall use a selection 
function of the form 

0(r) oc [r(l + (r/r,) 2 )]- 1 ; r* = 30 h^Mpc , (10) 

which we shall refer to as the "full" selection function (plotted as the solid line in Figure [l]). In 
many cases galaxy catalogues will have a sharply decaying selection function beyond a certain scale 
(Strauss & Willick 95) and it is therefore useful to check the effect such a feature will have on our 



estimator. We shall do so by considering the selection function of Eq. 10 truncated at r = r*. We 
refer to the latter as "truncated" (dashed line in Figure [l]). 

A very important feature of this statistic is the possible presence of non-negligible covariance 
between values of the estimator at different scales. The fact that the covariance depends on the 
velocity correlation tensor, ^, will lead to larger covariance than what one might naively expect: 
the larger coherence length of this quantity (as compared to either £(r) or v 12 (r)) leads to a larger 
coherence in the covariance matrix and consequently to larger cross correlations between v 12 at 
different scales. In Figure |l](c,d), we plot the appropriately normalized covariance, Cov n (v 12 ,v' 12 ) = 
[(v 12 v[ 2 ) — {v 12 )(v[ 2 }}/ \/{v x2 )(v' 12 2 ) for a range of separations from 10/i -1 Mpc. An open universe 
has a longer coherence length then the flat universe, and therefore a stronger covariance; also we 
see that for a shallow <j)(r), the correlations between the estimates of v 12 {r) will be large. 



3. Tests with mock catalogues 

To test the reliability of the results derived above we now apply our statistic to mock catalogues 
extracted from N-body simulations of a dust-filled universe with 17 = 1 and P(k) oc 1/k. We use 
one realization of this model universe in a box which is 235/i _1 Mpc on a side and is normalized 
to cr 8 =0.7. From this box we extract sets of mock catalogues following the procedure described in 
Davis, Nusser &: Willick (1996) however we emphasize several features. Small-scale velocities have 
been suppressed to ~ 200 km s _1 ; this is not a self-consistent procedure and will lower the amplitude 
of Eq. [2] by ~ 10%. In exchange for this relatively small inaccuracy, our mock catalogues reproduce 
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the observations and observational errors more faithfully. Our mock observers are centered on 
particles moving at 600 km s _1 with small local shear; i.e., resembling conditions in the Local 
Group. In dense regions, the redshift fingers of god have been collapsed as is done in the Mark III 
(Willick et al 1997) and IRAS (Fisher et al 1994) catalogues. A typical catalogue will have between 
6000 to 11000 galaxies. 

In Figure ^(a) we plot v 12 (r) with one standard deviation calculated with 20 mock catalogues 
extracted with the full 4>(r) as described in the previous section. Each catalogue has a different 
observation position within the simulation volume and so an average over this set should resemble a 
true ensemble average. The mean is consistent with what one would expect from a direct calculation 
with Eq. |2| (which is plotted in Figure |2|(a) as a solid line). We have also performed this analysis 
without collapsing the cores; the results changed by very little. 

We repeat this calculation for a set of 9 catalogues all constructed from the same observation 
point using the full (Figure ^b) or truncated (Figure |2]c) selection function to randomly sample a 
fraction of galaxies within the simulation box. The variance in v 12 (r) is now solely due to finite 
sampling ("shot noise"); for catalogues with 2000 to 3000 galaxies we expect the variance to be 
\/2—^/3 times larger. We find that truncating the selection function changes the functional form, or 
slope, of the mean, making it a more sharply decreasing function of r than the ensemble average. 
It is therefore crucial when analyzing a catalogue to restrict oneself to scales much smaller than 
the effective cutoff scale of the selection function. 

Next we will address the impact of errors in distance measurements on the estimator, v 12 . 
Presently the best estimators use empirical correlations between intrinsic properties of the galaxies 
and luminosities. The errors in such methods lead to log- normal errors in the estimated distance of 
around 20% (for a clear description see Landy & Szalay 1992). These errors will naturally lead to 
biases in cosmological estimators involving distance measurements and peculiar velocities and are 
generically called Malmquist bias. There are formal prescriptions for correcting for these biases but 
they rely on assumptions about the correlations between errors in the distance measurement and 
the selection function. Clearly this should be addressed on a case-by-case basis. For the purpose of 
this Letter we shall assume no correlation between the distance estimator and the selection function. 

We shall model our errors assuming a Tully-Fisher law which resembles that inferred from the 
Mark III catalogue. The line width, 77, and absolute magnitude, M, are related by r] = eM+jyo with 
e = —0.1 and r/Q = —2. The line width obeys a Gaussian distribution with = 0.05 which lead to 
log- normal variance in the distance estimator, d, of a\ n d = 23%. Using one set of galaxies extracted 
from the simulation box we generate one hundred catalogues with these galaxies assuming random 
errors in the distance measurement according to the above distribution. To assess the importance 
of Malmquist bias we first evaluate v 12 using the raw (uncorrected) distances. To correct for 
Malmquist bias we use the prescription put forward in Landy & Szalay (1992): we correct the raw 
distance, cIr, to get the true distance, dt = dRexp(3.5af nd )4> r (exp(crf nd )dR) /<j)R(dR), where 4>r is 
the selection function estimated from the raw distances. In principle, given our assumptions, this 
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should correct for Malmquist bias. 

In Figure |3](a), we plot the results for the uncorrected simulations; Malmquist errors system- 
atically lower the values of v 12 on small scales while enhancing its amplitude on large scales (where 
the effect should be more dominant). However in Figure ||(b) we show that with the correction 
for general Malmquist errors to the distance estimator, it is possible to overcome this discrepancy. 
The l-o" errors now encompass the true v 12 over a wide range of scales. The Malmquist effect is 
more obvious in Figure f| where we plot the distribution of {^(lO/i -1 Mpc) for 1000 realizations 
with and without correction for Malmquist errors. If uncorrected, these errors will induce a bias of 
up to 30% in v 12 and lead to an underestimate of ^l°' e a s 2 . If properly accounted for, one can see 
from Figure |I] that this bias can be easily overcome. 



4. Discussion 

In this Letter we propose to estimate the mean pairwise streaming velocities of galaxies directly 
from peculiar velocity samples. We argue that it is a powerful measure of Q°' e a 8 2 . Combined with 
other dynamical estimates of f2°' 6 <7 8 this allows a direct estimate of CI. Our simulations show that 



this method is more robust than the £(r p ,7r) analyses of redshift catalogues (Fisher et al. 1994 
and references therein) because unlike the redshift space correlation function, v 12 is not sensitive 
to the presence of rich clusters of galaxies in the sample. Moreover, for v 12 , the random velocity 
errors average to zero instead of adding in quadrature as in the £(r p ,7r) method which estimates 
the pairwise velocity dispersion. 

We identified three possible sources of systematic errors in estimates of v 12 made directly from 
radial peculiar velocities of galaxies. We also found ways of reducing these errors; these techniques 
were successfully tested with mock catalogues. The potential sources of errors and their proposed 
solutions can be summarized as follows. 

(1) On the theoretical front, assuming a linear theory model of v 12 (r) at r ~ lOh^ 1 Mpc can 
introduce a considerable systematic error in the resulting estimate of a 8 2 Cl ^. For example, if <7 g = 1 
using the linear prediction for v 12 at r = Wh^ 1 Mpc would introduce a 25% systematic error (see 
eq. Q). We solve this problem by using the nonlinear expression for v 12 , derived by Juszkiewicz et 
al. (1998b). 

(2) On the observational front, a shallow selection function induces a large covariance between 
v 12 on different scales. This must be taken into consideration by measuring v 12 (r) only on sufficiently 
small scales. A rule of thumb is that for estimating v 12 at 10/i -1 Mpc, the selection function should 
be reasonably homogeneous out to at least 30/i _1 Mpc. 

(3) Finally, care must be taken with generalized Malmquist bias due to log-normal distance 
errors; these induce a systematic error in v 12 . We have shown that, under certain assumptions about 
selection and measurement errors, the method of Landy & Szalay (1992) for corrected distance 
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estimates allows one to recover the true v 12 . Naturally, this particular correction must be addressed 
on a case- by-case basis, given that different data sets will have different selection criteria and 
correlations between galaxy position and measurement errors. 

In a future publication we shall analyze the Mark III (Willick et al. 1997) and the SFI (da 
Costa et al. 1996) catalogues of galaxies with this in mind. 
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Fig. 1. — Top panels: v 12 (r) (dotted line) and its variance for both the full (solid line) and the 
truncated (dashed line) selection function for CDM spectra with f2 = 0.3 (a) and $1 = 1 (b) 
calculated using Eq. [2] with a = 0. The fluctuations are normalized such that J7°' 6 <7 8 2 = 0.49. 
Bottom panels: the normalized covariance function for CDM models with 0, = 0.3 (c) and f2 = 1 
(d). The solid (dashed) line corresponds to the full (truncated) selection function. 
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Fig. 2. — v 12 (r) (points) and its variance (dashed lines) evaluated from mock catalogues described 
in the text: A) random observers with the full selection function compared to v 12 (r) evaluated 
from Eq. ^ (solid line); b) A fixed observer with full selection function; c) a fixed observer with a 
truncated selection function. The variance is estimated from the scatter over 20 (a) or 9 (b,c) mock 
catalogues. 




Fig. 3. — v 12 (r) and its variance evaluated from 100 mock catalogues with errors (described in 
the text) and the full selection function. The solid points are the v 12 of the error-free simulation 
seen from the same observation point, the solid line is the mean and dashed lines are the 1 a. a) 
uncorrected distances; b) distances corrected for Malmquist bias 
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Fig. 4. — The effect of generalized Malmquist bias on {^(lO/i -1 Mpc). The thick vertical line is the 
true value, the dashed histogram is the uncorrected estimate and the solid histogram is corrected 
for Malmquist bias. This is taken from 1000 realizations of log-normal errors (as described in the 
text). The histograms have been normalized to unity 



